Comment on "Fast and accurate modeling of molecular atomization energies with machine learning".
نویسنده
چکیده
In a recent Letter [1], the authors construct a machine learning (ML) model of molecular atomization energies, which they compare to bond counting (BC) and the PM6 semiempirical method [2]. However, their ML model was trained and tested on density functional theory (DFT) energies while BC and PM6 are fit to standard enthalpies. For fair comparison, bond energies are refit to DFT data and PM6 is converted to an electronic energy using peratom corrections [3]. BC and PM6 both perform better than the ML model and are free of large outliers in their error distributions as shown in Fig. 1. As noted in [25] of [1], some ML model error may originate from the coordinate system choice. The n eigenvalues of the Coulomb matrix correspond to an equienergy 2n-dimensional space of n-atom molecules rather than one molecule. For n 1⁄4 3, this corresponds to the 3 translations and 3 rotations that naturally preserve the energy of an isolated molecule. For n > 3, the space includes unphysical molecular deformations that destroy structural rigidity. Figure 2 shows this with a distortion of acetylene (C2H2) that preserves its ML energy and coordinate, (53.058, 21.149, 0.290, 0.219). It is suggested in [25] of [1] that the n sorted entries of a Coulomb matrix might be utilized instead of its n eigenvalues as a ML coordinate system. This eliminates the dimensional deficiency, but produces identical coordinates for homometric molecules [5] that do not necessarily have equal energies. A computationally expensive alternative is the equivalence class of permuted Coulomb matrices with distance metric
منابع مشابه
Fast and accurate modeling of molecular atomization energies with machine learning.
We introduce a machine learning model to predict atomization energies of a diverse set of organic molecules, based on nuclear charges and atomic positions only. The problem of solving the molecular Schrödinger equation is mapped onto a nonlinear statistical regression problem of reduced complexity. Regression models are trained on and compared to atomization energies computed with hybrid densit...
متن کاملModeling of molecular atomization energies using machine learning
Atomization energies are an important measure of chemical stability. Machine learning is used to model atomization energies of a diverse set of organic molecules, based on nuclear charges and atomic positions only [1]. Our scheme maps the problem of solving the molecular time-independent Schrödinger equation onto a non-linear statistical regression problem. Kernel ridge regression [2] models ar...
متن کاملAssessment and Validation of Machine Learning Methods for Predicting Molecular Atomization Energies.
The accurate and reliable prediction of properties of molecules typically requires computationally intensive quantum-chemical calculations. Recently, machine learning techniques applied to ab initio calculations have been proposed as an efficient approach for describing the energies of molecules in their given ground-state structure throughout chemical compound space (Rupp et al. Phys. Rev. Let...
متن کاملMachine Learning Predictions of Molecular Properties: Accurate Many-Body Potentials and Nonlocality in Chemical Space
Simultaneously accurate and efficient prediction of molecular properties throughout chemical compound space is a critical ingredient toward rational compound design in chemical and pharmaceutical industries. Aiming toward this goal, we develop and apply a systematic hierarchy of efficient empirical methods to estimate atomization and total energies of molecules. These methods range from a simpl...
متن کاملLearning Invariant Representations of Molecules for Atomization Energy Prediction
The accurate prediction of molecular energetics in chemical compound space is a crucial ingredient for rational compound design. The inherently graph-like, non-vectorial nature of molecular data gives rise to a unique and difficult machine learning problem. In this paper, we adopt a learning-from-scratch approach where quantum-mechanical molecular energies are predicted directly from the raw mo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Physical review letters
دوره 109 5 شماره
صفحات -
تاریخ انتشار 2012